Lower Bounds for Learning Discrete Distributions

نویسنده

  • Rocco A. Servedio
چکیده

We study the model of learning discrete probability distributions which was introduced by Kearns, Mansour, Ron, Rubinfeld, Schapire, and Sellie in 13]. Our main results are a lower bound on computational sample complexity and a lower bound on hypothesis size for this learning model. We deene a class of eeciently learnable distributions which has the following interesting property: while a computationally unbounded learning algorithm can learn the class from O(1) examples, the computational sample complexity of the class is essentially (1==) in the sense that any polynomial-time learning algorithm must use at least this many examples. The construction is based on the existence of any length-preserving one-way permutation. We also prove a lower bound on hypothesis size. Based on the existence of any one-way function, we construct a class of probability distributions learnable in polynomial time from ~ O(1== 2) examples and prove that any polynomial-time learning algorithm for this class must use hypotheses of size essentially (1== 1=2): This answers a conjecture posed by Kearns et al. in 13]. Note: After completing a draft of this paper, it was brought to our attention that Moni Naor 16] had already established, via a slightly diierent construction, results which are essentially equivalent to our main result of Section 5 (the lower bound on hypothesis size). We thank the unknown referee who brought 16] to our attention.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On discrete a-unimodal and a-monotone distributions

Unimodality is one of the building structures of distributions that like skewness, kurtosis and symmetry is visible in the shape of a function. Comparing two different distributions, can be a very difficult task. But if both the distributions are of the same types, for example both are unimodal, for comparison we may just compare the modes, dispersions and skewness. So, the concept of unimodali...

متن کامل

Exact maximum coverage probabilities of confidence intervals with increasing bounds for Poisson distribution mean

 ‎A Poisson distribution is well used as a standard model for analyzing count data‎. ‎So the Poisson distribution parameter estimation is widely applied in practice‎. ‎Providing accurate confidence intervals for the discrete distribution parameters is very difficult‎. ‎So far‎, ‎many asymptotic confidence intervals for the mean of Poisson distribution is provided‎. ‎It is known that the coverag...

متن کامل

Communication-Efficient Distributed Learning of Discrete Distributions

We initiate a systematic investigation of distribution learning (density estimation) when the data is distributed across multiple servers. The servers must communicate with a referee and the goal is to estimate the underlying distribution with as few bits of communication as possible. We focus on non-parametric density estimation of discrete distributions with respect to the `1 and `2 norms. We...

متن کامل

Minimax Estimator of a Lower Bounded Parameter of a Discrete Distribution under a Squared Log Error Loss Function

The problem of estimating the parameter ?, when it is restricted to an interval of the form , in a class of discrete distributions, including Binomial Negative Binomial discrete Weibull and etc., is considered. We give necessary and sufficient conditions for which the Bayes estimator of with respect to a two points boundary supported prior is minimax under squared log error loss function....

متن کامل

Faithfulness and learning hypergraphs from discrete distributions

The concepts of faithfulness and strong-faithfulness are important for statistical learning of graphical models. Graphs are not sufficient for describing the association structure of a discrete distribution. Hypergraphs representing hierarchical log-linearmodels are considered instead, and the concept of parametric (strong-)faithfulnesswith respect to a hypergraph is introduced. The strength of...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007